Goto

Collaborating Authors

 Macon


Data centers are amazing. Everyone hates them.

MIT Technology Review

In these politically divisive times, there's one thing we all agree on--we don't want a giant data center in our backyard. Behold, the hyperscale data center! Massive structures, with thousands of specialized computer chips running in parallel to perform the complex calculations required by advanced AI models. A single facility can cover millions of square feet, built with millions of pounds of steel, aluminum, and concrete; feature hundreds of miles of wiring, connecting some hundreds of thousands of high-end GPU chips, and chewing through hundreds of megawatt-hours of electricity. These facilities run so hot from all that computing power that their cooling systems are triumphs of engineering complexity in themselves. But the star of the show are those chips with their advanced processors.


Does Local News Stay Local?: Online Content Shifts in Sinclair-Acquired Stations

Wanner, Miriam, Hager, Sophia, Field, Anjalie

arXiv.org Artificial Intelligence

Local news stations are often considered to be reliable sources of non-politicized information, particularly local concerns that residents care about. Because these stations are trusted news sources, viewers are particularly susceptible to the information they report. The Sinclair Broadcast group is a broadcasting company that has acquired many local news stations in the last decade. We investigate the effects of local news stations being acquired by Sinclair: how does coverage change? We use computational methods to investigate changes in internet content put out by local news stations before and after being acquired by Sinclair and in comparison to national news outlets. We find that there is clear evidence that local news stations report more frequently on national news at the expense of local topics, and that their coverage of polarizing national topics increases.


Specializing General-purpose LLM Embeddings for Implicit Hate Speech Detection across Datasets

Cheremetiev, Vassiliy, Ngo, Quang Long Ho, Kot, Chau Ying, Baia, Alina Elena, Cavallaro, Andrea

arXiv.org Artificial Intelligence

Implicit hate speech (IHS) is indirect language that conveys prejudice or hatred through subtle cues, sarcasm or coded terminology. IHS is challenging to detect as it does not include explicit derogatory or inflammatory words. To address this challenge, task-specific pipelines can be complemented with external knowledge or additional information such as context, emotions and sentiment data. In this paper, we show that, by solely fine-tuning recent general-purpose embedding models based on large language models (LLMs), such as Stella, Jasper, NV-Embed and E5, we achieve state-of-the-art performance. Experiments on multiple IHS datasets show up to 1.10 percentage points improvements for in-dataset, and up to 20.35 percentage points improvements in cross-dataset evaluation, in terms of F1-macro score.


LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

Zhao, Qingfei, Wang, Ruobing, Cen, Yukuo, Zha, Daren, Tan, Shicheng, Dong, Yuxiao, Tang, Jie

arXiv.org Artificial Intelligence

Long-Context Question Answering (LCQA), a challenging task, aims to reason over long-context documents to yield accurate answers to questions. Existing long-context Large Language Models (LLMs) for LCQA often struggle with the "lost in the middle" issue. Retrieval-Augmented Generation (RAG) mitigates this issue by providing external factual evidence. However, its chunking strategy disrupts the global long-context information, and its low-quality retrieval in long contexts hinders LLMs from identifying effective factual details due to substantial noise. To this end, we propose LongRAG, a general, dual-perspective, and robust LLM-based RAG system paradigm for LCQA to enhance RAG's understanding of complex long-context knowledge (i.e., global information and factual details). We design LongRAG as a plug-and-play paradigm, facilitating adaptation to various domains and LLMs. Extensive experiments on three multi-hop datasets demonstrate that LongRAG significantly outperforms long-context LLMs (up by 6.94%), advanced RAG (up by 6.16%), and Vanilla RAG (up by 17.25%). Furthermore, we conduct quantitative ablation studies and multi-dimensional analyses, highlighting the effectiveness of the system's components and fine-tuning strategies. Data and code are available at https://github.com/QingFei1/LongRAG.


Resilient Estimator-based Control Barrier Functions for Dynamical Systems with Disturbances and Noise

Tao, Chuyuan, Wan, Wenbin, Gao, Junjie, Mo, Bihao, Kim, Hunmin, Hovakimyan, Naira

arXiv.org Artificial Intelligence

Control Barrier Function (CBF) is an emerging method that guarantees safety in path planning problems by generating a control command to ensure the forward invariance of a safety set. Most of the developments up to date assume availability of correct state measurements and absence of disturbances on the system. However, if the system incurs disturbances and is subject to noise, the CBF cannot guarantee safety due to the distorted state estimate. To improve the resilience and adaptability of the CBF, we propose a resilient estimator-based control barrier function (RE-CBF), which is based on a novel stochastic CBF optimization and resilient estimator, to guarantee the safety of systems with disturbances and noise in the path planning problems. The proposed algorithm uses the resilient estimation algorithm to estimate disturbances and counteract their effect using novel stochastic CBF optimization, providing safe control inputs for dynamical systems with disturbances and noise. To demonstrate the effectiveness of our algorithm in handling both noise and disturbances in dynamics and measurement, we design a quadrotor testing pipeline to simulate the proposed algorithm and then implement the algorithm on a real drone in our flying arena. Both simulations and real-world experiments show that the proposed method can guarantee safety for systems with disturbances and noise.


Synergistic Perception and Control Simplex for Verifiable Safe Vertical Landing

Bansal, Ayoosh, Zhao, Yang, Zhu, James, Cheng, Sheng, Gu, Yuliang, Yoon, Hyung-Jin, Kim, Hunmin, Hovakimyan, Naira, Sha, Lui

arXiv.org Artificial Intelligence

Perception, Planning, and Control form the essential components of autonomy in advanced air mobility. This work advances the holistic integration of these components to enhance the performance and robustness of the complete cyber-physical system. We adapt Perception Simplex, a system for verifiable collision avoidance amidst obstacle detection faults, to the vertical landing maneuver for autonomous air mobility vehicles. We improve upon this system by replacing static assumptions of control capabilities with dynamic confirmation, i.e., real-time confirmation of control limitations of the system, ensuring reliable fulfillment of safety maneuvers and overrides, without dependence on overly pessimistic assumptions. Parameters defining control system capabilities and limitations, e.g., maximum deceleration, are continuously tracked within the system and used to make safety-critical decisions. We apply these techniques to propose a verifiable collision avoidance solution for autonomous aerial mobility vehicles operating in cluttered and potentially unsafe environments.


2D Convolutional Neural Network for Event Reconstruction in IceCube DeepCore

Peterson, J. H., Rodriguez, M. Prado, Hanson, K.

arXiv.org Machine Learning

IceCube DeepCore is an extension of the IceCube Neutrino Observatory designed to measure GeV scale atmospheric neutrino interactions for the purpose of neutrino oscillation studies. Distinguishing muon neutrinos from other flavors and reconstructing inelasticity are especially difficult tasks at GeV scale energies in IceCube DeepCore due to sparse instrumentation. Convolutional neural networks (CNNs) have been found to have better success at neutrino event reconstruction than conventional likelihood-based methods. In this contribution, we present a new CNN model that exploits time and depth translational symmetry in IceCube DeepCore data and present the model's performance, specifically for flavor identification and inelasticity reconstruction.


Recent neutrino oscillation result with the IceCube experiment

Yu, Shiqi, Micallef, Jessie

arXiv.org Artificial Intelligence

The IceCube South Pole Neutrino Observatory is a Cherenkov detector instrumented in a cubic kilometer of ice at the South Pole. IceCube's primary scientific goal is the detection of TeV neutrino emissions from astrophysical sources. At the lower center of the IceCube array, there is a subdetector called DeepCore, which has a denser configuration that makes it possible to lower the energy threshold of IceCube and observe GeV-scale neutrinos, opening the window to atmospheric neutrino oscillations studies. Advances in physics sensitivity have recently been achieved by employing Convolutional Neural Networks to reconstruct neutrino interactions in the DeepCore detector. In this contribution, the recent IceCube result from the atmospheric muon neutrino disappearance analysis using the CNN-reconstructed neutrino sample is presented and compared to the existing worldwide measurements.


Observation of high-energy neutrinos from the Galactic plane

Abbasi, R., Ackermann, M., Adams, J., Aguilar, J. A., Ahlers, M., Ahrens, M., Alameddine, J. M., Alves, A. A. Jr., Amin, N. M., Andeen, K., Anderson, T., Anton, G., Argüelles, C., Ashida, Y., Athanasiadou, S., Axani, S., Bai, X., V., A. Balagopal, Barwick, S. W., Basu, V., Baur, S., Bay, R., Beatty, J. J., Becker, K. -H., Tjus, J. Becker, Beise, J., Bellenghi, C., Benda, S., BenZvi, S., Berley, D., Bernardini, E., Besson, D. Z., Binder, G., Bindig, D., Blaufuss, E., Blot, S., Boddenberg, M., Bontempo, F., Book, J. Y., Borowka, J., Böser, S., Botner, O., Böttcher, J., Bourbeau, E., Bradascio, F., Braun, J., Brinson, B., Bron, S., Brostean-Kaiser, J., Burley, R. T., Busse, R. S., Campana, M. A., Carnie-Bronca, E. G., Chen, C., Chen, Z., Chirkin, D., Choi, K., Clark, B. A., Clark, K., Classen, L., Coleman, A., Collin, G. H., Connolly, A., Conrad, J. M., Coppin, P., Correa, P., Cowen, D. F., Cross, R., Dappen, C., Dave, P., De Clercq, C., DeLaunay, J. J., López, D. Delgado, Dembinski, H., Deoskar, K., Desai, A., Desiati, P., de Vries, K. D., de Wasseige, G., DeYoung, T., Diaz, A., Díaz-Vélez, J. C., Dittmer, M., Dujmovic, H., Dunkman, M., DuVernois, M. A., Ehrhardt, T., Eller, P., Engel, R., Erpenbeck, H., Evans, J., Evenson, P. A., Fan, K. L., Fazely, A. R., Fedynitch, A., Feigl, N., Fiedlschuster, S., Fienberg, A. T., Finley, C., Fischer, L., Fox, D., Franckowiak, A., Friedman, E., Fritz, A., Fürst, P., Gaisser, T. K., Gallagher, J., Ganster, E., Garcia, A., Garrappa, S., Gerhardt, L., Ghadimi, A., Glaser, C., Glauch, T., Glüsenkamp, T., Goehlke, N., Goldschmidt, A., Gonzalez, J. G., Goswami, S., Grant, D., Grégoire, T., Griswold, S., Günther, C., Gutjahr, P., Haack, C., Hallgren, A., Halliday, R., Halve, L., Halzen, F., Minh, M. Ha, Hanson, K., Hardin, J., Harnisch, A. A., Haungs, A., Helbing, K., Henningsen, F., Hettinger, E. C., Hickford, S., Hignight, J., Hill, C., Hill, G. C., Hoffman, K. D., Hoshina, K., Hou, W., Huang, F., Huber, M., Huber, T., Hultqvist, K., Hünnefeld, M., Hussain, R., Hymon, K., In, S., Iovine, N., Ishihara, A., Jansson, M., Japaridze, G. S., Jeong, M., Jin, M., Jones, B. J. P., Kang, D., Kang, W., Kang, X., Kappes, A., Kappesser, D., Kardum, L., Karg, T., Karl, M., Karle, A., Katz, U., Kauer, M., Kellermann, M., Kelley, J. L., Kheirandish, A., Kin, K., Kiryluk, J., Klein, S. R., Kochocki, A., Koirala, R., Kolanoski, H., Kontrimas, T., Köpke, L., Kopper, C., Kopper, S., Koskinen, D. J., Koundal, P., Kovacevich, M., Kowalski, M., Kozynets, T., Krupczak, E., Kun, E., Kurahashi, N., Lad, N., Gualda, C. Lagunas, Lanfranchi, J. L., Larson, M. J., Lauber, F., Lazar, J. P., Lee, J. W., Leonard, K., Leszczyńska, A., Li, Y., Lincetto, M., Liu, Q. R., Liubarska, M., Lohfink, E., Mariscal, C. J. Lozano, Lu, L., Lucarelli, F., Ludwig, A., Luszczak, W., Lyu, Y., Ma, W. Y., Madsen, J., Mahn, K. B. M., Makino, Y., Mancina, S., Mariş, I. C., Martinez-Soler, I., Maruyama, R., McCarthy, S., McElroy, T., McNally, F., Mead, J. V., Meagher, K., Mechbal, S., Medina, A., Meier, M., Meighen-Berger, S., Merckx, Y., Micallef, J., Mockler, D., Montaruli, T., Moore, R. W., Morik, K., Morse, R., Moulai, M., Mukherjee, T., Naab, R., Nagai, R., Nahnhauer, R., Naumann, U., Necker, J., Nguyen, L. V., Niederhausen, H., Nisa, M. U., Nowicki, S. C., Nygren, D., Pollmann, A. Obertacke, Oehler, M., Oeyen, B., Olivas, A., O'Sullivan, E., Pandya, H., Pankova, D. V., Park, N., Parker, G. K., Paudel, E. N., Paul, L., Heros, C. Pérez de los, Peters, L., Peterson, J., Philippen, S., Pieper, S., Pizzuto, A., Plum, M., Popovych, Y., Porcelli, A., Rodriguez, M. Prado, Pries, B., Przybylski, G. T., Raab, C., Rack-Helleis, J., Raissi, A., Rameez, M., Rawlins, K., Rea, I. C., Rechav, Z., Rehman, A., Reichherzer, P., Reimann, R., Renzi, G., Resconi, E., Reusch, S., Rhode, W., Richman, M., Riedel, B., Roberts, E. J., Robertson, S., Roellinghoff, G., Rongen, M., Rott, C., Ruhe, T., Ryckbosch, D., Cantu, D. Rysewyk, Safa, I., Saffer, J., Salazar-Gallegos, D., Sampathkumar, P., Herrera, S. E. Sanchez, Sandrock, A., Santander, M., Sarkar, S., Sarkar, S., Satalecka, K., Schaufel, M., Schieler, H., Schindler, S., Schmidt, T., Schneider, A., Schneider, J., Schröder, F. G., Schumacher, L., Schwefer, G., Sclafani, S., Seckel, D., Seunarine, S., Sharma, A., Shefali, S., Shimizu, N., Silva, M., Skrzypek, B., Smithers, B., Snihur, R., Soedingrekso, J., Sogaard, A., Soldin, D., Spannfellner, C., Spiczak, G. M., Spiering, C., Stamatikos, M., Stanev, T., Stein, R., Stettner, J., Stezelberger, T., Stokstad, B., Stürwald, T., Stuttard, T., Sullivan, G. W., Taboada, I., Ter-Antonyan, S., Thwaites, J., Tilav, S., Tischbein, F., Tollefson, K., Tönnis, C., Toscano, S., Tosi, D., Trettin, A., Tselengidou, M., Tung, C. F., Turcati, A., Turcotte, R., Turley, C. F., Twagirayezu, J. P., Ty, B., Elorrieta, M. A. Unland, Valtonen-Mattila, N., Vandenbroucke, J., van Eijndhoven, N., Vannerom, D., van Santen, J., Veitch-Michaelis, J., Verpoest, S., Walck, C., Wang, W., Watson, T. B., Weaver, C., Weigel, P., Weindl, A., Weiss, M. J., Weldert, J., Wendt, C., Werthebach, J., Weyrauch, M., Whitehorn, N., Wiebusch, C. H., Willey, N., Williams, D. R., Wolf, M., Wrede, G., Wulff, J., Xu, X. W., Yanez, J. P., Yildizci, E., Yoshida, S., Yu, S., Yuan, T., Zhang, Z., Zhelnin, P.

arXiv.org Artificial Intelligence

The origin of high-energy cosmic rays, atomic nuclei that continuously impact Earth's atmosphere, has been a mystery for over a century. Due to deflection in interstellar magnetic fields, cosmic rays from the Milky Way arrive at Earth from random directions. However, near their sources and during propagation, cosmic rays interact with matter and produce high-energy neutrinos. We search for neutrino emission using machine learning techniques applied to ten years of data from the IceCube Neutrino Observatory. We identify neutrino emission from the Galactic plane at the 4.5$\sigma$ level of significance, by comparing diffuse emission models to a background-only hypothesis. The signal is consistent with modeled diffuse emission from the Galactic plane, but could also arise from a population of unresolved point sources.


Certified Robust Control under Adversarial Perturbations

Yang, Jinghan, Kim, Hunmin, Wan, Wenbin, Hovakimyan, Naira, Vorobeychik, Yevgeniy

arXiv.org Artificial Intelligence

Autonomous systems increasingly rely on machine learning techniques to transform high-dimensional raw inputs into predictions that are then used for decision-making and control. However, it is often easy to maliciously manipulate such inputs and, as a result, predictions. While effective techniques have been proposed to certify the robustness of predictions to adversarial input perturbations, such techniques have been disembodied from control systems that make downstream use of the predictions. We propose the first approach for composing robustness certification of predictions with respect to raw input perturbations with robust control to obtain certified robustness of control to adversarial input perturbations. We use a case study of adaptive vehicle control to illustrate our approach and show the value of the resulting end-to-end certificates through extensive experiments.